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Abstract 

We say that a first order sentence A defines a graph G if A is true on G 
but false on any graph non-isomorphic to G. Let L(G) (resp. D(G)) denote 
the minimum length (resp. quantifier rank) of a such sentence. We define the 
succinctness function s(n) (resp. its variant q(n)) to be the minimum L{G) 
(resp. D{G)) over all graphs on n vertices. 

We prove that s(n) and q(n) may be so small that for no general recursive 
function / we can have /(s(n)) > n for all n. However, for the function 
q*(n) = maxj< n (/(z), which is the least monotone nondecreasing function 
bounding q(n) from above, we have q*(n) = (1 + o(l))log*n, where log* n 
equals the minimum number of iterations of the binary logarithm sufficient to 
lower n to 1 or below. 

We show an upper bound q(n) < log* n + 5 even under the restriction of 
the class of graphs to trees. Under this restriction, for q(n) we also have a 
matching lower bound. 

We show a relationship D(G) > (1 — o(l)) log* L(G) and prove, using the 
upper bound for q(n), that this relationship is tight. 

For a non-negative integer a, let D a (G) and q a (n) denote the analogs of 
D{G) and q(n) for defining formulas in the negation normal form with at 
most a quantifier alternations in any sequence of nested quantifiers. We show 
a super recursive gap between Dq(G) and D%{G) and hence between Dq(G) 
and D(G). Despite it, for qo(n) we still have a kind of log-star upper bound: 
qo( n ) < 2 log* n + O(l) for infinitely many n. 
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1 Introduction 



We study sentences about graphs expressible in the laconic first order language with 
two relation symbols ~ and = for, respectively, the adjacency and the equality re- 
lations. The first order means that we are allowed to quantify only over vertices, 
in opposite to second order logic where we can quantify over sets of vertices. The 
difference between the first order and the second order worlds is essential. In the 
first order language we cannot express many basic properties of graphs, as the con- 
nectedness, the property of being bipartite etc (see, e.g., [24, theorems 2.4.1 and 
2.4.2]). On the other hand, the crucial for us fact is that the first order language is 
powerful enough to define any individual finite graph up to isomorphism. Indeed, 
a graph G with vertex set V{G) = {1, . . . ,n} and edge set E(G) is defined by the 
formula 



Etei . . . 3x n Vx n+ i ( f\ ~^( x i = x i) A V x n+i = x i 

l<i<j<n i<n 
A f\ Xi ~ Xj A f\ ->(Xi ~ Xj)j 



{i,j}&E{G) {i,ME(G) 

This fact, though very simple, highlights a fundamental difference between the finite 
and the infinite: There are non-isomorphic countable graphs satisfying precisely the 
same first order sentences (see, e.g., [24, theorem 3.3.2]). 

The question we address is how succinctly a graph G on n vertices can be defined 
by first order means. We consider two natural measures of succinctness — the length 
of a first order formula and its quantifier rank. The latter is the maximum number 
of nested quantifiers in the formula. Let D{G) be the minimum quantifier rank of a 
closed first order formula defining G, that is, being true on G and false on any other 
graph non-isomorphic to G. The sentence (1) ensures that D(G) < n + 1. This 
bound generally cannot be improved as D(G) — n + 1 for G being the complete or 
the empty graph on n vertices. However, for all other graphs we have D{G) < n. 
Thus, it is reasonable to try to lower the trivial upper bound of n + 1 to some 
u(n) < n and explicitly describe all exceptional graphs with D(G) > u(n). This is 
done in [19] with u(n) = n/2 + 0(1) (see also [20] for a generalization to arbitrary 
structures). More precisely, let us call two vertices of a graph similar if they are 
simultaneously adjacent or not to any other vertex. This is an equivalence relation 
and each equivalence class spans a complete or an empty subgraph. Let cr(G) denote 
the maximum number of pairwise similar vertices in G. Then, as shown in [19], 

a(G) + l <D(G) < max j^y^, (7(G) + 2j. 

It seems doubtful that results of this sort are possible to obtain with upper bound 
u(n) = cn + 0(1) for each constant c < 1/2. The known Cai-Furer-Immerman 
construction [2] gives graphs with linear D(G) that may serve as counterexamples 
to most natural conjectures in this direction. 

While the paper [19] addresses the definability of n- vertex graphs in the worst 
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case, in [12] we treat the average case. Let G be a random graph distributed uni- 
formly among the graphs with vertex set {1, . . . , n}. Then, as shown in [12], 

\D(G) - log 2 n\ = 0(log 2 log 2 n) 

with probability 1 — o(l). 

We now consider another extremal case of the graph definability problem. How 
succinct can be a first order definition of a graph on n vertices in the best case? 
Namely, we address the succinctness function q(n) defined as the minimum D(G) 
over n-vertex G. We also define L(G) to be the minimum length of a sentence 
defining G and s(G) to be the minimum L(G) over n-vertex G. Trivially, q(n) < 
s(n). Our first result is that s(n) and q(n) may be so small that for no general 
recursive function / we can have f(s(n)) > n for all n. 

The proof is based on simulation of a Turing machine M by a first order for- 
mula A M in which a computation of M determines a graph satisfying A M and vise 
versa. Such techniques were developed in the classic research on Hilbert's Entschei- 
dungsproblem by Turing, Trakhtenbrot, Biichi and other researchers (see [1] for 
survey and references). The novel feature of our simulation is that it works if we 
restrict the class of structures to graphs. The key ingredient of our proof is a gadget 
allowing us to impose an order relation on the vertex set of a graph. 

As a by-product, we obtain another proof of Lavrov's result [14] that the first 
order theory of finite graphs is undecidable. Our proof actually shows the undecid- 
ability of the V*3 p V s 3*-fragment of this theory for some p, s, and t. 

From the fact that q(n) and n are not recursively linked, it easily follows that, if a 
general recursive function l(n) is monotone nondecreasing and tends to the infinity, 
then 

q(n) < l(n) for infinitely many n. (2) 
Our next result establishes a general upper bound 

q(n) < log* n + 5 for all n. (3) 

Here log* n equals the minimum number of iterations of the binary logarithm suffi- 
cient to lower n below 1. It turns out that this is the best possible monotonic upper 
bound for q(n). Let q*(n) = maxj<„ q(i), which is the least monotone nondecreasing 
function bounding q{n) from above. We prove that 

q*(n) > log* n -log* log* n- 0(1). (4) 

As the upper bound (3) is monotonic, we obtain 

q*(n) = (l + o(l)) log* n. (5) 

Comparing (5) to (2) with l(n) = log*n, we conclude that q(n) infinitely often 
deviates from its "smoothed" version q*(n) and, in particular, is essentially non- 
monotonic. 

Proving (3) and (4), we use a robust technical tool given by the Ehrenfeucht 
game [5] (these techniques were also developed by Fraisse [7] in a different setting). 
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As a matter of fact, we prove the upper bound (3) under the restriction of the 
class of graphs to trees only, that is, we have q(n) < q(n; trees) < log* n + 5. Recall 
that, by (2), q(n) is infinitely often so small that we cannot bound it from below 
by any "regular" function. The proof of this fact cannot be carried through for 
q(n; trees) because, as a well-known corollary of the Rabin theorem [21], the first 
order theories of both all and finite trees are decidable and hence a Turing machine 
computation cannot be simulated by a first order sentence about trees. In fact, for 
q(n; trees) we establish a matching lower bound thereby determining this function 
asymptotically, namely, 

q(n; trees) = (1 + o(l)) log* n. 

We pay a special attention to defining sentences having a restricted structure. 
For a non-negative integer a, let D a (G) and q a {ji) denote the analogs of D{G) and 
q(n) for defining formulas in the negation normal form with at most a quantifier 
alternations in any sequence of nested quantifiers. The superrecursive gap between 
s(n) and n is actually shown even under the restriction of the alternation number to 
3. Note also that, as follows from a result in [12], g 3 (n) < log* n + 0(1) and hence 
(5) holds with alternation number 3. 

On the other hand, we show a superrecursive gap between Dq(G) and D^(G) 
and hence between Dq(G) and D(G). Despite it, for qo(n) we also have a kind of 
log-star upper bound: qo{n) < 21og*n + 0(1) for infinitely many n. It is worth 
noting that this is not the first case that we have close results for the alternation 
number and for the unbounded alternation number. In [12] we prove that for 
a random graph D(G) and Dq(G) are not so far apart from each other. Namely, 
Dq(G) < (2 + o(l))log 2 n with probability 1 — o(l). Yet another result showing 
the same phenomenon is obtained in [19]. Given non-isomorphic graphs G and G', 
let D{G,G') (resp. D (G,G')) denote the minimum quantifier rank of a sentence 
(resp. in the negation normal form with no quantifier alternation) which is true on 
exactly one of the graphs. As shown in [19], if both G and G' have n vertices, 
then D(G,G f ) < Do(G,G f ) < (n + 5)/2 and there are simple examples of such G 
and G' with D{G, G') > (n + l)/2. Note that logical distinguishing non-isomorphic 
graphs with equal number of vertices has close connections to graph canonization 
algorithms (see, e.g., [2, 8, 19] and a monograph [10]). 

Relating D(G) and L(G) to one another, we show that 

J D(G)>(l-o(l))log*L(G). 

Using the bound (3), we show that this relationship is tight. 

Focusing on defining formulas of restricted structure, we also consider prenex 
formulas. A superrecursive gap between s(n) and n can actually be shown under the 
restriction to this class. Nevertheless, prenex formulas generally are not competetive 
against defining formulas with no restriction on structure. We observe that graphs 
showing a huge gap between D{G) and L(G) at the same time show a huge gap 
between D{G) and its version for prenex defining formulas. 

In conclusion, note that all of our results carry over to general structures over any 
relational vocabulary with at least one non-unary relation symbol. For the upper 
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bounds this claim is straightforward because graphs can be viewed as a subclass of 
such structures which is distinguishable by a single first order sentence. The lower 
bounds hold true with minor changes in the proofs. 

2 Background 

2.1 Arithmetics 

We define the tower function T{i) by T(0) = 1 and T(i) = 2 T( ^~ 1 ) for each subsequent 
i. Sometimes this function will be denoted by Tower (i). Given a function /, by /W 
we will denote the i-fold composition of /. In particular, f^°\x) = x. By logn we 
always mean the logarithm base 2. The inverse of the tower function, the log-star 
function log* n, is defined by log* n = min {i : T(i) > n}. For a real x, the notation 
\x] (resp. L^J) stands for the integer nearest to x from above (resp. from below). 

2.2 Graphs 

Given a graph G, we denote its vertex set by V(G) and its edge set by E(G). The 
order of G, the number of vertices of G, will be sometimes denoted by that is, 
|Gr| = |V(Gr)|. The neighborhood of a vertex v consists of all vertices adjacent to v. 
A set S C V(G) is called independent if it contains no pair of adjacent vertices. If 
X C V(G), then G[X] denotes the subgraph induced by G on X (or spanned by X 
in G). Hue V(G), then G — u — G[V{G) \ {u}] is the result of removal from G the 
vertex u along with all incident edges. 

The distance between vertices u and v, the minimum length of a path connecting 
the two vertices, is denoted by d(u,v). If u and v are in different connected com- 
ponents of a graph, then d(u, v) = oo. The eccentricity of a vertex v is defined by 
e(v) = max ue y( G ) d(v,u). The diameter and the radius of a graph G are defined by 
d(G) = max ve v(G) e ( v ) an d r (G) = mm v€V (G) e ( v ) respectively. A path in a graph 
is diametral if its length is equal to the diameter of the graph. A vertex v is central 
ife(u)=r(G). 

Proposition 2.1 [17, Theorem 4.2.2] Let T be a tree. If d(T) is even, then T 
has a unique central vertex c and all diametral paths go through c. If d(T) is odd, 
then T has exactly two central vertices C\ and c 2 and all diametral paths go through 
the edge {c 1 , c 2 }. 

2.3 Logic 
2.3.1 Formulas 

First order formulas are assumed to be over the set of connectives {->, A, V}. A 
sequence of quantifiers is a finite word over the alphabet {3, V}. If S is a set of such 
sequences, then 3S (resp. \/S) means the set of concatenations 3s (resp. Vs) for all 
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s G S. If s is a sequence of quantifiers, then s denotes the result of replacement of 
all occurrences of 3 to V and vise versa in s. The set S consists of all s for s G S. 

Given a first order formula A, its set of sequences of nested quantifiers is denoted 
by Nest(A) and defined by induction as follows: 

1) Nest(A) = {e} if A is atomic; here e denotes the empty word. 

2) Nest(^A) = Nest (A). 

3) Nest{A A B) = Nest{A V 5) = Nest(A) U Nest(B). 

4) IVestprrA) = 3Nest(A) and Nest(VxA) = VJVest(A). 

The quantifier rank of a formula A, denoted by qr(A) is the maximum length of a 
string in Nest (A). 

We adopt the notion of the alternation number of a formula (cf. [18, Definition 
2.8]). Given a sequence of quantifiers s, let alt(s) denote the number of occurrences 
of 3V and V3 in s. The alternation number of a first order formula A, denoted 
by alt(A), is the maximum edt(s) over s G Nest(A). The alternation number has 
an absolutely clear meaning for formulas in the negation normal form, where the 
connective -i occurs only in front of atomic subformulas. This number is defined for 
any formula A so that, if A is reduced to an equivalent formula A' in the negation 
normal form, then alt(A) = alt(A'). 

Viewing a formula A as a string of symbols over the countable first order alphabet 
(where each variable and each relation is denoted by a single symbol), we denote the 
length of A by \A\. Note that if one prefers, in a natural way, to encode variable and 
relation symbols in a finite alphabet, then the length will increase but stay within 
\A\ log \A\. 

We call A an 3-formula (resp. V '-formula) if any sequence in Nest (A) with max- 
imum number of quantifier alternations starts with 3 (resp. V). We denote the set 
of formulas in the negation normal form with alternation number at most m by A m . 
By A^ (resp. A^J we denote the subset of A m consisting of formulas in A m _! and 
3-formulas (resp. V-formulas) in A m \ A m _i. We will call formulas in Aq and Aq 
existential and universal respectively. 

A prenex formula is a formula with all its quantifiers up front. In this case there 
is a single sequence of nested quantifiers and the quantifier rank is just the number 
of quantifiers occurring in a formula. Let Si and IL denote, respectively, the sets 
of existential and universal prenex formulas. Furthermore, let S m (resp. Il m ) be the 
extension of S m _i U Il m _i with prenex formulas in A^_ x (resp. A^ l _ 1 ). Note that 
the classes of formulas A m , A^, A^, E m , and Il m are defined so that they are closed 
with respect to subformulas. 

The following lemma is an immediate consequence of the standard reduction of 
a formula to the prenex form. 

Lemma 2.2 The conjunction of Y.m-formulas (resp. Tim-formulas) is effectively re- 
ducible to an equivalent Y, m -formula (resp. Tl m -formula) . The same holds for the 
disjunction. ■ 
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We write A = B if A and 5 are logically equivalent formulas and A = B if A 
and B are literally the same. 

Lemma 2.3 

1) Any formula in is effectively reducible to an equivalent formula in S m +i. 

2) Any formula in A^ is effectively reducible to an equivalent formula in U m +i- 

3) Any formula in A m is effectively reducible to an equivalent formula in S m+ 2 
or, as well, to an equivalent formula in U m +2- 

Proof. Item 3 follows from Items 1 and 2 as A m is included both in A^ +1 and A^ +1 . 
To prove Items 1 and 2, we proceed by induction on to. 

Consider the base case of to = 0. Assume that A G Aq and let t = t(A) denote 
the total number of quantifiers and connectives A, V in A. We prove that A has an 
equivalent formula A' G Si using induction on t. If t = 0, then A is quantifier-free 
and hence in S . Let t > 1. Assume that A = 3x_B. Since t(_B) = t(A) — 1, the 
assumption of induction on t applies to B. Therefore B reduces to an equivalent 
formula 5' G Si and we set A' = 3xB'. Assume that A = B A C (the case that 
A = B V C is similar). Neither of t(-B) and i(C) exceeds t(A) — 1 and, by the 
assumption of induction on t, for B and C we have equivalents B' and C in Si. 
Then A = B' A C" reduces to an equivalent in Si by Lemma 2.2. 

The reducibility of A^ to IIi is proved similarly. 

Let m > 1 and assume that Items 1 and 2 of the lemma are true for the preceding 
value of m. Given A e A^, we show how to find an equivalent formula A' G S m+ i 
(the reduction of A^ to U m +i is similar). We again use induction on t = t(A). If 
t = 0, then A is in S . Let t > 1. If A = VrrE, then A e A^_ : and, by the 
assumption of induction on to, A has an equivalent A' e II m C S m+1 . If A = 3x_B, 
A = B A C, or A = 5 V C, then B, C G A^ and both t(B) and t(C) are smaller 
than t(A). We are done by the assumption of induction on t and Lemma 2.2. ■ 

A formula with all variables bound is called a closed formula or a sentence. 

Lemma 2.4 If A is a closed prenex formula of quantifier rank q with occurrences 
of h binary relation symbols, then it can be rewritten in an equivalent form A' with 
the same quantifier prefix so that \A'\ — 0(hq 2 2 hq ). 

Proof. Let B(x 1 , . . . , x q ) be the quantifier-free part of A. The B is a Boolean 
combination of to = h^fj atomic subformulas and hence is representible as a DNF 
of length 0(m2 m ). ~ - 



8 



2.3.2 Structures 



A relational vocabulary a is a finite set of relation symbols augmented with their 
arities. We always assume the presence of the binary relation symbol = standing for 
the equality relation and do not include it in a. The only exception will be Subsection 
5.4 where the presence or the absence of equality will be stated explicitly. 

A structure over vocabulary o (or an u-structure) is a set along with relations 
that are named by symbols in a and have the corresponding arities. We mostly 
deal with the vocabulary of a single binary relation symbol. A structure over this 
vocabulary can be viewed as a directed graph (or digraph). We treat graphs as 
structures with a single binary relation which is symmetric and anti-relexive. This 
relation will be called the adjacency relation and denoted by ~. 

If all relation symbols of a sentence A are from the vocabulary o and G is an 
c-structure, then A is either true or false on G. In the former case G is called a 
model of A. We also say that G satisfies A. We call A valid if all cr-structures satisfy 
A. We call A (finitely) satisfiable if it has a (finite) model. Clearly, A is valid iff -<A 
is unsatisfiable. 

2.3.3 Computability 

Whenever we say that something can be done effectively, we mean that this can 
be implemented by an algorithm. No restrictions on running time or space are 
assumed. Professing Church's thesis, we here do not specify any definition of the 
algorithm. Nevertheless, we will refer to Turing machines (see Subsection 4.2.1) and 
recursive functions in Sections 4 and 5. As a basic fact, these two computational 
models are equally powerful, under an effective bijection between binary words and 
non-negative integer numbers. 

Let X be a set of words over a finite alphabet. The decision problem for X is 
the problem of recognition whether or not a given word belongs to X. If there is an 
algorithm that does it, the decision problem is solvable (or X is decidable). 

The halting problem is the problem of deciding, for given Turing machine M and 
input word w, whether M eventually halts on w or runs forever. This is a basic 
unsolvable problem. It is well known that, if we fix w to be the empty word, the 
restricted problem remains unsolvable. 

The (finite) satisfiability problem is the problem of recognizing whether or not 
a given sentence is (finitely) satisfiable (we here assume any natural encoding of 
formulas in a finite alphabet). Settling Hilbert's Entscheidungsproblem, Church and 
Turing proved that the satisfiability problem is unsolvable. The unsolvability of the 
finite satisfiability problem was shown by Trakhtenbrot [25]. 

A general recursive function is an everywhere defined recursive function. 

2.3.4 The Bernays-Schonfinkel class of formulas and the Ramsey theo- 
rem 

A class of formulas has the finite model property if every satisfiable formula in the 
class has a finite model. By the completeness of the predicate calculus with equality, 
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the set of valid sentences is recursively enumerable. From here it is not hard to 
conclude that, if a class of formulas has the finite model property, the satisfiability 
and the finite satisfiability problems for this class are solvable. 

The Bernays-Schdnfinkel class consists of prenex formulas in which the existen- 
tial quantifiers all precede the universal quantifiers, that is, this is another name 
for £ 2 . 

Proposition 2.5 (The Ramsey theorem [22] 1 ) For each vocabulary a there is 
a general recursive function f : N — > N such that the following is true: Assume that 
a a -sentence A with equality is in the Bernays-Schdnfinkel class. If A has a model 
of some cardinality at least f(qr(A)) (possibly infinite), then it has a model in every 
cardinality at least f(qr(A)). As a consequence, the Bernays-Schdnfinkel class of 
formulas with equality has the finite model property and hence both the satisfiability 
and the hnite satishability problems restricted to this class are solvable. 

2.3.5 Definability 

Let G and G' be non-isomorphic graphs and A be a first order sentence with equality 
over vocabulary {~}. We say that A distinguishes G from G' if A is true on G but 
false on G' . By D(G, G') (resp. D^(G, G')) we denote the minimum quantifier rank 
of a sentence (with alternation number at most k resp.) distinguishing G from G' . 

We say that a sentence A defines a graph G (up to isomorphism) if A distin- 
guishes G from any non-isomorphic graph G' . To ensure that A has no other models 
except graphs, we will tacitly assume that A has form A = W x (x x A s i y {x ~ y — > 
y ~ x)) A B. By D(G) (resp. D a (G)) we denote the minimum quantifier rank of 
a sentence defining G (with alternation number at most a resp.). By L{G) (resp. 
L a (G)) we denote the minimum length of a sentence defining G (with alternation 
number at most a resp.). 

A sentence is called defining if it defines a graph. Note that any defining sentence 
must contain the equality symbol. Let us stress that graphs G' in the above definition 
may have any cardinality. 

Lemma 2.6 All finite graphs and only finite graphs posses defining sentences. 

Proof. Any finite graph is indeed definable as it has at least the wasteful defini- 
tion (1). By the Upward Lowenheim-Skolem theorem (see [16, corollary 2.35]), if a 
sentence with equality has an infinite model, it has a model of any infinite cardi- 
nality. By this reason, no infinite graph has defining sentence in the sence of our 
definition. ■ 

Lemma 2.7 The class of defining Af-sentences is decidable. 

Proof. Suppose that we are given a sentence A e A±. By Lemma 2.3 (1), we can 
reduce it to an equivalent formula in the Bernays-Schdnfinkel class and apply the 
Ramsey theorem. We are able to recognize if A is defining in four steps. 

1 The combinatorial Ramsey theorem, a cornerstone of Ramsey theory, appeared in this paper 
as a technical tool. 
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1) Check if A is finitely satisfiable. 

2) If so, trying graphs one by one, we eventually find a graph of the smallest order 
n satisfying A (this is actually done in the first step, if it is based directly on 
the Ramsey theorem). 

3) Check if there is any other graph of order n satisfying A. 

4) If not, check if a A^-sentence A A 3 Xl ,...,x n+1 (/\i<i<j<n+i x i x j) is satisfiable. 
If not, and only in this case, A is defining. ■ 



3 The Ehrenfeucht game 

In this section we borrow a lot of material from [24, section 2]. To make our ex- 
position self-contained, we sketch some proofs that can be found in [24] in more 
detail. 

The Ehrenfeucht game is played on a pair of structures of the same vocabulary. 
We give the definition conformable to the case of graphs. 

Let G and H be graphs with disjoint vertex sets. The /c-round Ehrenfeucht game 
on G and H, denoted by EHR fc (G, H), is played by two players, Spoiler and Dupli- 
cator, with k pairwise distinct pebbles Pi, ■ ■ ■ ,Pk, each given in duplicate. Spoiler 
starts the game. A round consists of a move of Spoiler followed by a move of Du- 
plicator. At the i-th move Spoiler takes pebble Pi, selects one of the graphs G or 
H, and places pi on a vertex of this graph. In response Duplicator should place the 
other copy of pi on a vertex of the other graph. It is allowed to place more than one 
pebble on the same vertex. 

Let Ui (resp. v{) denote the vertex of G (resp. H) occupied by Pi, irrespectively 
of who of the players placed the pebble on this vertex. If 

Ui = Uj iff Vi = Vj for all 1 < i < j < k, 

and the component-wise correspondence (u±, . . . , u^) to (v±, . . . , v^) is a partial iso- 
morphism from G to H, this is a win for Duplicator; Otherwise the winner is Spoiler. 

The a- alternation Ehrenfeucht game on G and H is a variant of the game in 
which Spoiler is allowed to switch from one graph to another at most a times during 
the game, i.e., in at most a rounds he can choose the graph other than that in the 
preceding round. 

Let < s < k, r = k — s, and assume that at the start of the game the 
pebbles pi, . . . ,p s are already on the board at vertices u — u±, . . . , u s of G and 
v — i>i, . . . , v s of H. The r-round game with this initial configuration is denoted by 
EHR r (G, u, H, v). We write G, u=kH,v if Duplicator has a winning strategy in this 
game. 

It is not hard to check that =k is an equivalence relation. The /c-Ehrenfeucht 
value of a graph G with vertices u±, . . . ,u s marked by pebbles is the equivalence 
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class it belongs to under = k . We let Ehrv(k,s) denote the set of all possible k- 
Ehrenfeucht values for graphs with s marked vertices. Let Ehrv(k) = Ehrv(k,0) 
denote the set of /c-Ehrenfeucht values for graphs (with no marked vertex). 

Lemma 3.1 Assume that s < k. Let u — u±, . . . ,u s and S(G, u) denote the set of 
= k -equivalence classes of G with s+1 marked vertices u, u for all u G G\{ui, . . . , u s }. 
Then G,u= k H,v iff S(G,u) = S(H,v). 

Proof. Consider the game EHR fc _ s (G, u, H, v). Suppose that S(G,u) ^ S(H,v), 
for example, there is u G V(G) such that G,u,u ^ k H,v,v for any v G V(H). Let 
Spoiler select this u and let v denote Duplicator's response. From now on the players 
actually play EHR fe _ s _!(G, u, u, H, v, v), where Spoiler has a winning strategy. 

Suppose that S(G,u) = S(H,v). If Spoiler selects, for example, a vertex u G 
V(G), then Duplicator responds with v G V(H) such that G,u,u = k H,v,v and 
hence has a winning strategy in the remaining part of the game. ■ 



Lemma 3.2 [24, theorem 2.2.1] For any s and k, Ehrv{k, s) is a finite set. Fur- 
thermore, let f(k,s) = \Ehrv(k, s)\. Then 

f(k, k) < 4("), (6) 
f(k,s) < 2 / ( fc < s+1 ) (7) 

for s < k. 

Proof. The bound (6) holds because the ^-equivalence class of G with marked 
Mi, . . . , u k is determined by the equality relation on the sequence ui, . . . , u k and the 
induced subgraph G[{ui, . . . , u k }]. The bound (7) holds because the ^-equivalence 
class of an arbitrary G with marked u = ui,...,u s is, according to Lemma 3.1, 
determined by S(G, u), a subset of Ehrv (k, s + 1). ■ 

As a consequence, we obtain the following bound. 

Lemma 3.3 [24, theorem 2.2.2] \Ehrv{k)\ < T(k + 2 + log* k) + 0(1). ■ 

We say that a formula A(xi, . . . , x s ) with s free variables defines an Ehrenfeucht 
value a G Ehrv(k, s) if A is true on a graph G with variables Xi, . . . ,x s assigned 
vertices ui, . . . ,u s for exactly those G, ui, . . . , u s which are in a. 

Lemma 3.4 [24, theorem 2.3.2] For any a G Ehrv(k ) s) there is a formula A a 
with qr(A a ) = k — s that defines a. Moreover, 

\A a \ < 18 Q if s = k and (8) 
< f(k, s + 1) (max { | A p \ : f3 G Ehrv (k, s + 1)} + 10) if s < k. (9) 
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Proof. The bound (8) holds because every a G Ehrv(k, k) is defined by a formula 
of the type 

f\ (*(xi = Xj) A*(xi ~ Xj)), 

l<i<j<k 

where * and * is -> for some of and nothing for the others, depending on 

adjacencies among the marked vertices of a G, ui, . . . , Uk in a. 

Let s < k and assume that every (5 G Ehrv(k,s + 1) has a defining formula 
Ap(xi, . . . ,x s ,x) of quantifier rank k — s — 1. Consider an a G Ehrv(k,s) and 
choose a representative G, u of a. Define S(a) = S(G, u), where the right hand side 
is as in Lemma 3.1. By this lemma, the definition does not depend on a particular 
choice of G,u. We set 

A a (x 1: . . . ,x 9 ) = f\ %Ap(x 1 ,...,x a ,x)A f\ ^3 x Ap(x 1: ...,x s ,x). 

It is clear that G with designated u — u±, . . . , u s satisfies A a iff the set of Ehrenfeucht 
values with additional designated u is equal to S(a). By Lemma 3.1, the latter 
condition is true iff G, u has Ehrenfeucht value a. ■ 

Proposition 3.5 Suppose that G and H are non- isomorphic graphs. 

1) Let R(G, H) denote the minimum k such that G and H have different k- 
Ehrenfeucht values. Then D(G, H) = R(G, H). In other words, D(G, H) 
equals the minimum k such that Spoiler has a winning strategy in EHR fe (G, H). 

2) D a (G,H) equals the minimum k such that Spoiler has a winning strategy in 
the a-alternation EHRfc(G, H). 

We refer the reader to [24, Theorem 2.3.1] for the proof of the first claim and to [18] 
for the second claim. 

Proposition 3.6 

D(G) = max { D(G, H) : H and G are non-isomorphic} , 
D a (G) = max{_D a (G, H) : H and G are non-isomorphic} . 

The first equality can he restated as follows: D(G) equals the minimum k such that 
the k-Ehrenfeucht value of G contains only graphs isomorphic to G. 

Proof. We give a proof of the first equality that can be easily adopted for the second 
equality. Denote the maximum in the right hand side by k. We have k < D(G) as 
a matter of definition. Conversely, let a G Ehrv(k) be the class containing G. By 
Proposition 3.5, G is, up to isomorphism, the only member of a. For each (3 ^ a 
in Ehrv(k), fix a representative Hp. Let Cp be a sentence of quantifier rank at 
most k distinguishing G from Hp. We use Lemma 3.2 saying that Ehrv(k) is finite. 
The conjunction of all Cp defines G and has quantifier rank k. Thus, D(G) < k. 
(Alternatively, we could use the known fact that, over a finite vocabulary, there are 
only finitely many inequivalent sentences of bounded quantifier rank, cf. Lemma 
5.6.) - 
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4 A superrecursive gap: Simulating a Turing ma- 
chine 



Definition 4.1 We define the succinctness function s(n) (for formula length) by 

s(n) = min L{G). 

\G\=n 

The variants with bounded alternation number are defined by 

s a (n) = min L a (G) 

\G\=n 

for each a > 0. 



It turns out that s(n) can be so small with respect to n that the gap between 
the two numbers cannot be bounded by any recursive function. 

Theorem 4.2 There is no general recursive function f such that 

f(s 3 (n)) > n for all n. (10) 

Lemma 4.3 (Simulation Lemma) Given a Turing machine M with k states, one 
can effectively construct a sentence A M with single binary relation symbol ~ and 
equality so that the following conditions are met. 

1) qr(A M ) =k + 16. 

2) \A M \ = 0(k 2 ). 

3) alt(A M ) = 3. 

4) Am is effectively reducible to an equivalent prenex formula Pm whose quantifier 
prefix has length k + 0(1), begins with k existential quantifiers, and has 3 
quantifier alternations. 

5) Any model of A M is a graph. If M halts on the empty input word, then A M 
has a unique model Gm and the order of Gm is bigger than the running time 
ofM. 

6) M halts on the empty input word iff A M has a finite model. 

Proof of Theorem 4.2. Let g(k) denote the longest running time on the empty 
input word e of a fc-state Turing machine (non-halting machines are excluded from 
consideration). Recognition whether or not a given Turing machine with k states 
halts on e easily reduces to computation of g(k). As this variant of the halting 
problem is well known to be undecidable, the function g(k) cannot be bounded from 
above by any general recursive function. For each k, fix a machine with k states 
whose running time attains g(k). Let A Mk be as in the Simulation Lemma, be 
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the model of A Mk , and n k be the order of G k . Let l(k) = ck 2 be the upper bound 
for \A Mk \ ensured by the lemma. Note that A Mk defines G k . 

Suppose on the contrary that (10) is true for some general recursive /. Since 
s^ijik) < |AmJ < l(k), for every k we have 

g(k) <n k < f(s 3 (n k )) < max/(i), 

i<l(k) 

a contradiction. ■ 
The proof of the Simulation Lemma takes the rest of this section. 



4.1 Gadgets 

We enrich our language with connectives — > and <-> for the implication and the 
equivalence. Since the alternation number was defined for formulas with connectives 
-i, A, V, we should stress that — > and <-> are used as shorthands for their standard 
definitions through -i, A, V. We introduce the new uniqueness quantifier 3! by 

3\ x F(x) = 3 x F(x) A y x y y (F(x) A F(y) - x = y) 

for any formula F with a free variable x and with no free occurrences of y. Note 
that one occurrence of the uniqueness quantifier contributes 2 in the quantifier rank 
and 1 in the alternation number. We use relativized versions of the existential and 
the universal quantifiers in the standard way: 

3 c{x) F(x) = 3 x (C(x) A F(x)), 

y C ( X) F(x) = y x (c(x)^F(x)). 

To ensure that any model of A M is a graph, we put in A M the two graph axioms 
(the irreflexivity and the symmetry of the relation ~). 

4.1.1 Ordering 

We give a formula P(x,x') with two free variables x and x' that, in any model, 
shall determine an order on the neighborhood of x. Let X = {y : y ~ x} and 
X' = {z : z ~ x'}. Then P(x,x') is the conjunction of the following: 

(PI) {x, x'}, X, X' are all disjoint and each of them is independent. 

(P2) V ye x3 ze x> y~z 

(P3) B yex 3\ zex , y~z 

(P4) 3 yeX V z ex> y~z 

(P5) y yi ex^ y2 ex^zex'(yi ~ z -> y 2 ~ z) V V zeX '(y 2 ~ z -> y x ~ z)] 
(P6) V^exV^exfyi 7^ 1/2 ^ 3 2eX /(y 1 ~ z <-> y 2 / z)] 
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(P7) Vyexfizex' V z -> 3 y+eX 3\ zeX ,(y + ~zAt/^z)] 
(P8) V^xp!^ j/~zV V^U^y ~ * A y~ + z)\ 

Note that qr(P) = 4, aJt(P) = 2 (contributed by (P7) and (P8)), and \P\ = 0(1). 

Consider finite models of P(x,x'). For y e X let N*(y) be those z G X' ad- 
jacent to y. The N*(y) are distinct (P6), linearly ordered under inclusion (P5), 
are nonempty (P2), include a singleton (P3) and all of X' (P4), and the set of all 
cardinalities |JV*(y)| has no gaps (either (P7) or (P8)). So we must have |X| = \X'\ 
and the elements can be ordered X i ; • • • i X g - X . • • • . X 1 _ so that adjacent 
precisely when j < i. We induce on X a binary relation < defined by 



In any model (even infinite) the properties (P1)-(P8) assure that < is a linear 
order with a least and greatest element. Furthermore, every y has a successor y + 
and a predecessor y~ except when y is the last or first element of X respectively. 

4.1.2 Coordinatization 

We now give a formula COOR(x, x' ' , t, t', z) that shall coordinatize the neighborhood 
of z. Let X, X', T, T', Z denote the neighborhoods of x, x', t, t' ', z respectively. Then 
COOR is the conjunction of the following: 

(CI) x,x,' ,t,t',z,X,X',T,T',Z are all disjoint. Z is an independent set. All 
neighbors of Z are in {z} U X U T. There is no edge between X U X' and 
TUT'. 

(C2) P(x,x') AP(M') 

(C3) V^^Ux* ~iA 3! teT ,2 ~ t) 

(C4) V xeX V t6T 3Uz(,z ~ x A * ~ t) 

Thus, each z e Z has a unique pair of coordinates (x, t) and each (x, t) corresponds 
to a unique z. Note that qr(COOR) = qr(P) = 4 and alt(COOR) = alt(P) = 2. 

4.1.3 New functional and constant symbols 

To facilitate further description of Am, we will use new functional symbols. In 
particular, this will allow us to have new constant symbols as symbols of miliary 
functions. 

Writing v, we will mean a finite sequence of variables i>i,i>2, • • •• As soon as a 
statement ~tfy3\xF(x, y) is put in Am or is derivable from what is already put in Am, 
we may want to denote this unique x by <p(y) and use as a new functional symbol 
in the standard way. Namely, if Q(u, z) is a formula with free variables u, z, then 



V\ < 2/2 = Vzex>(y 



'i ~ z — > y 2 ~ ^). 



3x(F(x, y) A z)) or 

Vx(F(x,y)^g(x,z)). 
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Both variants are admissible and an appropriate choice of one of them may reduce 
the alternation number of a formula. Furthermore, in this way we can express 
compositions of several functions (e.g. [16, section 2.9]). 

In particular, in any model of COOR(x, x', t, t', z) we let 1, 2 denote the first two 
elements of X (under <) and (it will represent time zero) the first element of T. 
The same character uj will be used for the last element of X or T, dependent on 
context. For v in X or T, v~ and v + are respectively its predecessor and successor 
(when defined). The notation (x,t) will be used as a binary function symbol with 
meaning as explained in the preceding subsection. 

4.2 Capturing a computation by a formula 

4.2.1 Definition of a Turing machine 

By technical reasons, we prefer to use the model of a Turing machine where the tape 
is infinite in one direction. It is known (e.g. [13, section 41]) that it is equivalent to 
the model with the tape infinite in both directions. At the start the tape consists of 
the special "Left End of Tape" symbol L, followed by an input word written down 
in the binary alphabet {a,b}, and followed onward by all "blank" symbols B. A 
symbol occupies one cell. Let s\, . . . , Sk be states of a Turing machine M, with si 
the inital state and Sk the final state. At the start M is in state s\ and its head is 
at the first B. A machine is defined by a set of instructions of the following type, 
where a, f3 G {L, a, b, B}. 

Siafisji If in state s, reading a symbol a, overwrite f3 and go to state Sj. 

Sia Rights y. If in state Sj reading a symbol a, move the head one cell to the right 
and go to state Sj. 

Sia Left Sji If in state reading a symbol a, move the head one cell to the left and 
go to state sj. 

If a = L in an instruction of the first type, then f3 = L. This is the only case when 
f3 = L. There is no instruction of the third type ("move to the left") for a = L. 
With this exception, for every i < k and a there is a unique instruction what to do 
in state Sj reading a. The machine halts immediately after coming to state s&. If M 
halts, its running time is the number of instructions executed before termination. 

4.2.2 Formula A M 

For notation simplicity, we use the same name for variables and corresponding se- 
mantical objects (ingredients of M and vertices of a graph Gm)- The vertex H 
below shall be used to keep track of the tape header. Am is the conjunction of the 
two graph axioms and a long formula of the form 

^x,x',t,t',z, S1 ,...,s k ,a,b,B,L,HBM(x, x ,t,t , Z, S 1 , . . . , s k , a, b, B, L, H). 
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The formula B M whose all free variables are listed above is the conjunction of the 
following subformulas, where X, X', T, T', Z denote, as before, the neighborhoods of 
x, x', t, t', z respectively. 

(Al) x,x' ,t,t' , z, si, . . . , Sk,a,b, B, L, H, X, X' ,T,T' , Z are disjoint and consist of 
all the vertices of the graph. 
(A2) COOR(x,x',t,t',z) 

(A3) All of the neighbors of a, b, B, L, H are in Z. 

(A4) For all x G X and t G T the vertex (x,t) is adjacent to precisely one of 
a,b,B,L. We will write VAL(x, t) for this value, which represents the symbol on 
the Turing Machine at position (cell of the tape) x and time t. Note that, as 
VAL(x,t) ranges over four possible values L,a,b,B, using this functional symbol 
requites no extra quantification. For example, the formula VAL(x, t) = a reads just 
(x, t) ~ a. 

(A5) All neighbors of H are in Z. For all t G T there is a unique x G X for 
which (x,t) is adjacent to H. We write HP{t) for this x, which represents the 
header position. Thus, HP(t) = x reads (x,t) ~ H. We shall write VAL(t) = 
VAL(HP(t),t), the symbol that the header is looking at time t. If HP is used within 
VAL, it takes one extra quantifier. Note that a subformula VAL(t) = a has quantifier 
rank 2 and alternation number 0. Furthermore, VAL(t + ) = a has quantifier rank 4 
and can be written with alternation number 0. 

(A6) The neighbors of si, . . . , are all in T. For all t G T precisely one of si, . . . , 
is adjacent to t. We write ST(t) for this s,, which represents the state at time t. 
Note that ST(t) = s = t ~ s. 

We want the Turing machine to start in the standard position: 
(A7) VAL(1, 0) = L A V x# i M4L(ar, 0) = B A iZP(O) = 2 A ST(0) = si 
We want the Turing Machine to end in the final state and not be there before that: 
(A8) y teT (ST(t) = Sk ^t = u) 

We want values on the tape not to change except (possibly) at the header position: 
(A9) V t eT,t*y x ex(.x ^ HP(t) - VAL(x, t+) = VAL(x, t)) 

We want the rightmost spot on the tape to be used. (We need this for uniqueness 
of the model, we don't want to allow superfluous blanks.) 
(A10) 3 teT VAL(uj,t) ^ B 

We need that the instructions would not push the Turing Machine to the right of 
x — uj. For every Sj, a such that when at state and value a the instruction push 
the header to the right we have: 
(All) ^3 teT ( VAL(t) = a A ST(t) = s, t A HP{t) = lu) 

We are down to the core workings of the Turing Machine. For each instruction 
of the first type we have: 

(A12) V teT V xeX (ST(t) = Si A HP(t) = x A VAL(t) = a^ ST(t + ) = Sj A VAL{t + ) = 
(3 A HP(t + ) = x) 

For each instruction of the second type we have: 

(A13) V t eTVxex(ST(t) = Si A HP(t) = x A VAL(t) = a -> ST(t + ) = Sj A HP(t+) = 
x + A VAL(x,t + ) = a) 

For each instruction of the third type we have: 

(A14) y teT y x€X (ST(t) = Si A Hp\t) = x A VAL(t) = a^ ST(t+) = Sj A HP(t + ) = 
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x- A VAL(x,t + ) = a) 



4.2.3 Proof of the Simulation Lemma 

A straightforward inspection shows that gt(I?m) = 6, contributed, for example, by 
(A9). This gives Item 1 of the lemma. Since we treat a variable as a single symbol, 
(Al) and (A6) have length 0(k 2 ), (A11)-(A14) have length 0(k), and all the others 
have constant length. This gives Item 2. A straightforward inspection shows that 
alt(B M ) = 2, contributed by (A2). This gives Item 3. 

Item 4 requires a bit of extra work. As A M e A|, Lemma 2.3 implies that A M 
is reducible to an equivalent prenex formula with quantifier prefix 3*\/*3*V*. We 
make a stronger claim that one can achieve the prefix 3*^/0(1)30(1)^0(1) ^ Note that 
Bm has a constant number of conjunctive members with constant length and hence 
those contribute a constant number of quantifiers. (Al) and (A6), though have 
length dependent on k, contain a constant number of quantifiers. The remaining 
(All)-(A14) should be tackled with more care as every of these components, though 
has constant number of quantifiers, occurs in B M in 0(k) variants for various pairs 
Si, a. Fortunately, all these occurrences can be replaced by a single formula with 
constantly many quantifiers. For example, introducing two new variables s and c, 
we can replace the conjunction of all variants of (All) by 

-i3t 6T 3 a 3 c [ V (s = Si A c = a) A VAL(t) = c A ST(t) = s A HP if) = uj], 

Si,a 

where the disjunction is over the specified pairs Sj, a. 

Let us turn to Items 5 and 6. It should be clear that, if M halts, its computation 
is converted to a graph satisfying A M , whose order exceeds the running time. Such 
a graph is unique up to isomorphism because the adjacencies of any finite model of 
Am must mirror the actions of the Turing machine. By the same reason, any finite 
model of Am is converted into a halting computation of M and hence, if Am has 
a finite model, then M halts on the empty input. It remains to notice that, if M 
halts, then Am has no infinite model. Let m be the running time of M. In any 
model of A M , the first m values of t must simulate m steps of M's computation. 
By (A8), the set T is therefore finite. By (A10), the cardinality of X cannot exceed 
the cardinality of T and hence X is finite too. It immediately follows that the other 
components of the model, X', T', and Z, are finite as well. The proof is complete. 

5 Other consequences of the Simulation Lemma 

5.1 There are succinct definitions by prenex formulas 

Due to (1), any graph of order n is definable by a prenex formula of quantifier rank 
ra+l with alternation number 1. Though the class of prenex formulas may appear 
rather restrictive, it turns out that, if one allows to increase the alternation number 
to 3, then there are graphs definable by prenex formulas with very small quantifier 
rank. 
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Definition 5.1 Let L% renex (G) denote the minimum length of a closed prenex for- 
mula with alternation number at most a that defines a graph G. Furthermore, 

\G\=n 

Theorem 5.2 There is no general recursive function f such that f{s^ renex (n)) > n 
for all n. 

Proof. We proceed precisely as in the proof of Theorem 4.2 but using, instead 
of Am, the prenex formula Pm given by the Simulation Lemma. We will need a 
recursive bound \Pu k \ < l(k). We can take 1(h) = ck 2 A k owing to Lemma 2.4. ■ 

5.2 The set of defining sentences is undecidable 
Theorem 5.3 The class of defining sentences is undecidable. 

Proof. Given a Turing machine M, consider a sentence A M as in the Simulation 
Lemma. If M halts on the empty input, Am is defining. Suppose that M never 
halts. Then either Am has no model or it has an infinite model. By Lemma 2.6, 
Am is not defining in both cases. We therewith have reduced the halting problem 
(for the empty input) to the decision problem for the set of defining sentences. ■ 

Note a partial positive result given by Lemma 2.7. 

5.3 Dq(G) and D(G) are not recursively related 

Obviously, D(G) < Dq(G) for all graphs G. How far apart from each other can be 
these two values? Is there a converse relation D (G) < f(D(G)), for any general 
recursive function /? The answer is "no". We will actually prove a stronger fact. 
Let Di/ 2 (G) denote the minimum quantifier rank of a A^-sentence that defines G. 
Notice the hierarchy 

D{G) < D 3 (G) < D 2 (G) < D ± (G) < D 1/2 (G) < D (G). 

We are able to show a superrecursive gap even between D 3 (G) and D 1 / 2 (G). 

Theorem 5.4 There is no general recursive function f such that 

D 1/2 (G) < f(D 3 (G)) 

for all graphs G. 

Lemma 5.5 The finite satishability of a A^-sentence is decidable. 

Proof. By Lemma 2.3, a A^-sentence effectively reduces to an equivalent formula 
in the Bernays-Schonfinkel class. The finite satisfiability of the latter is decidable 
by the Ramsey theorem. ■ 
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The next lemma is related to the well-known fact that, over a finite vocabulary, 
there are only finitely many pairwise inequivalent sentences of bounded quantifier 
rank (cf. [2, lemma 4.4]). 

Lemma 5.6 Given m > 0, one can effectively construct a finite set U m consisting 
of Af-sentences of quantifier rank m so that every A^-sentence of quantifier rank m 
has an equivalent in U m . 

Proof. Any sentence A of quantifier rank m can be rewritten in an equivalent form 
A' so that A' uses at most m variables, where different occurrences of the same 
variable are not counted (see e.g. [19, proposition 2.3]). Referring to this fact, we 
will put in U m only sentences over the variable set {x±, . . . , x m }. We now prove the 
lemma in a stronger form saying that, for each m and k such that < k < m, 
one can construct a finite set U m ^ which is universal for the class of A^-formulas of 
quantifier rank k over the variable set {x±, . . . , x m } with precisely k variables bound. 

We proceed by induction on k. Consider the base case of k — 0. There are 
a = 2^™) atomic formulas Xi ~ Xj and Xi = Xj. Any quantifier-free formula is a 
Boolean combination of these and can be represented by a perfect DNF (except the 
totally false formula for which we fix representation x\ — x\ A x\ ^ x\). The set 
U m fi consists of all 2 2 " such expressions. 

U m ^ will consist of two parts, U^ k and U^ k , the former for formulas with at least 
one existential quantifier and the latter for formulas with no existential quantifier. 
If k — 0, we have L7^ = and C/^ = U m $. Assume that k > 1 and U m ^-i has 
been already constructed. We construct U m ^ in four steps. 

1) Put in U^ k the formulas 3xiA for all A e U m ^i and i < m such that no 
occurrence of X; L in A is bound. 

2) Put in U^ k the formulas VxiA for all A G U^k-i an d i < rn such that no 
occurrence of X; L in A is bound. 

3) Put in U^ k all monotone Boolean combinations of formulas from U^ k and 
U^ k as constructed in Steps 1 and 2 with at least one formula from U^ k 
involved. 

4) Put in U^ nk all monotone Boolean combinations of formulas from U^ nk as 
constructed in Step 2. 

Finally, to obtain U m exactly as claimed in the lemma, we set U m = U m ^ m . ■ 

Proof of Theorem 5.4. Suppose on the contrary that a such / exists. Using the /, 
we will design an algorithm for the halting problem, contradicting the unsolvability 
of the latter. 

Given a Turing machine M, we construct the sentence Am as in the Simulation 
Lemma. Recall that 

• alt(A M ) = 3; 
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• if M halts on the empty input, then A M defines a finite graph G M ; 

• if M does not halt, then Am has no finite model. 

Denote k = qr(AM) and m = maxj<fc/(i). Thus, if Gm exists, then _D 3 (Gm) < k 
and, by the assumption, .D1/2 (Gm) < 

Construct U m as in Lemma 5.6 and add to every sentence in U m the two graph 
axioms. We know that U m contains a sentence defining Gm and this will help us 
to construct this graph (if it exists). Remove from U m all finitely unsatisfiable 
formulas. This task is tractable by Lemma 5.5. For every remaining sentence, by 
brute-force search we eventually find a finite graph satisfying it (we need one model 
for every sentence and do not care that some sentences may have other models). Let 
Gi, . . . , G\ be the list of these graphs. 

If M halts, one of the GVs coincides with Gm and satisfies A M - If M does not, 
none of the GVs satisfies A M - Thus, the verification if A M is true on one of the GVs 
allows us to recognize if M halts on the empty input. ■ 

Corollary 5.7 

1) There is no general recursive function f such that D (G) < f(D(G)) for all 
graphs G. 

2) There is no general recursive function f such that D (G,G') < f(D(G,G')) 
for all non- isomorphic G and G' . 

Proof. 1) Suppose on the contrary that a such / exists. Then we would have 
Di/2(G) < Dq(G) < f(D(G)) < maxj<£> 3 (G) f(i), contradictory to Theorem 5.4. 

2) Again, suppose that a such / exists. By Proposition 3.6, D (G) = D (G,G') 
for some G'. It follows that D (G) < f(D(G,G')) < maxj< D ((?) /(«), contradictory 
to Item 1. ■ 

It is also worthy to note the following fact. 

Theorem 5.8 D (G) and Di/ 2 (G) are computable functions of graphs. 

Proof. We prove the theorem for Di/ 2 (G); For D (G) the proof is similar. Starting 
from m = 2, we trace through the universal set U m given by Lemma 5.6 and, for 
each sentence A e U m , check whether G satisfies A and, if so, whether A is defining. 
The latter can be done on the account of Lemma 2.7. If no such A is found, we 
conclude that Dy 2 (G) > m and increase mby 1. ■ 

Remark 5.9 A variant of Theorem 5.4 for the formula length is also true, even 
with a simpler proof (no reference to Lemma 5.6 is needed). 
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5.4 An undecidable fragment of the theory of finite graphs 

Given a class of cr-structures C, let Sat{C) (resp. Sat = {C)) be the set of formulas 
over a without equality (resp. with equality) that have a model in C. Furthermore, 
let Satfi n (C) (resp. SatJJC)) be the set of formulas over a without equality (resp. 
with equality) that have a finite model in C. If X is one of the aforementioned sets 
and F is a class of formulas over a, we call the intersection F n X the F -fragment 
of X. We will be interested in the case that F is a prefix class, that is, consists 
of prenex formulas whose quantifier prefix agrees with a given pattern. Describing 
such a pattern, we use V* or 3* to denote a string of all V or all 3 of any length. 

Let V (resp. X and S) denote the class of structures consisting of a single binary 
relation (resp. irreflexive binary relation and symmetric binary relation). In other 
words, T> is the class of directed graphs. By Q we denote the class of graphs, i.e., 
structures consisting of a single irreflexive symmetric relation. 

Based on Church's and Turing's solution of Hilbert's Entscheidungsproblem, 
Kalmar [11] proved that Sat(T>) is undecidable. Following the Kalmar result and 
the Trakhtenbrot theorem [25], Vaught [26] proved that the set Satfi n (T>) and the 
set of formulas not in Sat(T>) are recursively inseparable, that is, no decidable set 
contains the former and is disjoint with the latter. In particular, both Satfi n (V) and 
Sat(V) are undecidable. Currently a complete classification of prefix fragments of 
Sat(V), Satfi n (V), Sat = (V), and Satj n (V) is known (see [1], a reference book on 
the subject). 

Gurevich [9] shows that the V 3 3*-fragment of SatiX) is undecidable while the 
V 2 3*-fragment is decidable (cf. [1, corollary 6.2.35]). Church and Quine [3] estab- 
lished the undecidability of Sat(S). Note that this result is easily extended to 
Sat = {Q). The undecidability of Sat{Q) was proved by Rogers [23]. Lavrov [14] 
(see also [6, theorem 3.3.3]) improved this by showing the recursive inseparability of 
Satfi n (Q) and the set of formulas not in Sat(Q). 

Lavrov's proof provides us with a reduction of the decision problem for V to the 
decision problem for Q. If combined with the known results on undecidable fragments 
of Satfi n (V), this gives us some undecidable fragments of Satfi n (Q), for example, 
V 9 3*V*3*. However, this method apparently cannot give undecidable fragments with 
less than two star symbols. Our Simulation Lemma has relevance to this circle of 
questions. 

Theorem 5.10 For some I, m, and n, the 3*\/ l 3 m \/ n -fragment of Sat^ n {Q) is unde- 
cidable. 

Proof. By the Simulation Lemma, a Turing machine M halts on the empty input 
iff the formula A M has a finite graph as a model. Thus, the conversion of A M to a 
prenex formula according to Item 4 of the Simulation Lemma reduces this variant of 
the halting problem to the satisfiability problem for 3*V'3 m V n -formulas over finite 
graphs. ■ 

The theorem should be contrasted with the decidability of the 3*V*-fragment, 
which follows from the Ramsey theorem and the fact that the class of graphs is 
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definable by a 3*V*-formula. We do not try to specify numbers l,m,n since the 
values derivable from our proof are, though not so big, surely improvable by extra 
technical efforts. Note that a variant of the theorem for Satfi n (T>) is known to be 
true with best possible I — m — n — 1 (see [1, theorem 3.3.2], which is Suranyi's 
theorem extended to the finite satisfiability by Gurevich). 

Note another equivalent form of Theorem 5.10. Let ThJ n (Q) denote the first 
order theory of finite graphs with equality, i.e., the set of first order sentences with 
relation symbols ~ and = that are true on all finite graphs. Observe that a sentence 
A is in Thfl n (G) iff ->A is not in SatJ n {Q). It follows that the V*3V m 3"-fragment of 
Thfi n (G) is undecidable. 

6 The succinctness function over trees: Upper 
bound 

We define a variant of the succinctness function for a class of graphs C (with respect 
to the quantifier rank) by 

q(n; C) = min { D{G) : G G C, \G\ = n} . 

We here prove a log-star upper bound for the class of trees. 

Theorem 6.1 q(n; trees) < \og* n + 5. 

The proof takes the rest of this section. 

6.1 Rooted trees 

A rooted tree is a tree with one distinguished vertex, which is called the root. If T 
is a tree and v G V(T), then T v denotes the tree T rooted at v. An isomorphism of 
rooted trees should not only preserve the adjacency relation but also map one root 
to the other. Thus, for distinct u, v G V(T), rooted trees T u and T v , though have 
the same underlying tree T, may be non- isomorphic. 

An automorphism of a rooted tree is an isomorphism from it onto itself. Obvi- 
ously, any automorphism leaves the root fixed. We call a rooted tree asymmetric if 
it has no non-trivial automorphisms, that is, no automorphisms except the identity. 

The depth of a rooted tree T v , which is denoted by clepthT v , is the eccentricity 
of its root. If (v, . . . , u, w) is a path in T v , then w is called a child of u. We define 
the relation of being a descendant to be the transitive and reflexive closure of the 
relation of being a child. 

If w G V(T V ), then T v (w) denotes the subtree of T v spanned by the set of all 
descendants of w and rooted at w. If w is a child of u G V(T V ), then T v (w) is called 
a u-branch of T v . 



24 



6.2 Diverging trees 

We call T v diverging if, for every vertex u G V(T V ), all -u-branches of T v are pairwise 
non-isomorphic. 

Lemma 6.2 A rooted tree T v is diverging iff its v-branches are pairwise non- 
isomorphic and each of them is diverging. 

Proof. Assume that T v is diverging. Its f-branches are pairwise non-isomorphic 
by the definition. Furthermore, let T v (w) be a f-branch of T v and u G V(T v (w)). 
Note that any -u-branch of T v (w) is also a -u-branch of T v . Therefore, all of them are 
pairwise non-isomorphic and T v (w) is diverging. 

For the other direction, consider a non-root vertex u of T v and let T v (w) be the 
■u-branch of T v containing u (w — u is possible). Note that any it-branch of T v is 
also a -u-branch of T v (w). Therefore, all of them are pairwise non-isomorphic and 
we conclude that T v is diverging. ■ 

Lemma 6.3 A rooted tree T v is diverging iff it is asymmetric. 

Proof. We proceed by induction on d — depthT v . The base case of d — is trivial. 
Let d > 1. 

Assume that T v is diverging. By Lemma 6.2, no automorphism of T v can map one 
-y-branch onto another w-branch. By the same lemma and the induction assumption, 
no non-trivial automorphism can map a -y-branch onto itself. Thus, T v has no non- 
trivial automorphism. 

Assume now that T v is asymmetric. Hence all f-branches are pairwise non- 
isomorphic and each of them is asymmetric. By the induction assumption, each 
f-branch is diverging. By Lemma 6.2 we conclude that T v is diverging. ■ 

We now carry over the notion of a diverging tree to (unrooted) trees. Clearly, any 
automorphism of a tree T either leaves central vertices c\ and c 2 fixed or transposes 
them [c\ = c 2 if the diameter d(T) is even). If d(T) is odd, Lemma 6.3 implies that 
T C1 and T C2 are simultaneously diverging or not. This makes the following definition 
correct: A tree T is diverging if the rooted tree T c for a central vertex c is diverging. 
It is not hard to see that T is diverging iff one of the following conditions is met: 

1) T has no non-trivial automorphism. 

2) T has exactly one non-trivial automorphism and this automorphism transposes 
two central vertices of T. 

6.3 Spoiler's strategy 

In this section we exploit the characterization of the quantifier rank of a distinguish- 
ing formula as the length of the Ehrenfeucht game (see Proposition 3.5). 
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Lemma 6.4 Suppose that in the Ehrenfeucht game on (G, G') some two vertices 
1,1/6 V(G) at distance k were selected so that their counterparts x' ', y' G V(G') are 
at a strictly larger distance (possibly infinity). 

Then Spoiler can win in at most [log k] extra moves, playing all the time inside G. 

Proof. Spoiler sets u\ = x, u 2 = y, v\ = x', v 2 = y', and places a pebble on the 
middle vertex u in a shortest path from u\ to u 2 (or either of the two middle vertices 
if d(u\,u 2 ) is odd). Let v G V{G') be selected by Duplicator in response to u. By 
the triangle inequality, we have d(u, u m ) < d(v, v m ) for m = 1 or m = 2. For such 
m Spoiler resets U\ = u, u 2 = u m , V\ — v, v 2 — v m and applies the same strategy 
once again. Therewith Spoiler ensures that, in each round, d(u 1 ,u 2 ) < d(vi,v 2 ). 
Eventually, unless Duplicator loses earlier, d(ui,U2) = 1 while d(vi,v 2 ) > 1, that is, 
Duplicator fails to preserve adjacency. 

To estimate the number of moves made, notice that initially d(ui,u 2 ) = k and 
for each subsequent u±, u 2 this distance becomes at most f(d(ui,u 2 )), where f(a) = 
(a + l)/2. Therefore the number of moves does not exceed the minimum % such 
that f (i) (k) < 2. As = 2*/3 - 2* + 1, the latter inequality is equivalent to 

2* > k, which proves the bound. ■ 

Note that the bound of Lemma 6.4 is tight, more precisely, it cannot be improved 
to [log A;] — 1. For example, let C n denote a cycle of length n and 2C n the disjoint 
union of two such cycles. It is known (e.g. [24, Proof of Theorem 2.4.2] or [4, Example 
2.3.8]) that Duplicator can survive in the Ehrenfeucht game on C 2 k+\ and C 2 k+ 2 in 
more than log k + 1 rounds for any strategy of Spoiler, in particular, when Spoiler 
begins with selecting two antipodal vertices in C 2 k+ 2 . Furthermore, if d(x',y') = oo, 
Duplicator can be persistent as well. For example, she can survive in the game on 
C 2 k and 2C 2 k during [\og(2k — 1)J rounds for any strategy of Spoiler, in particular, 
when Spoiler's first move is in one component of 2C 2 k and his second move is in the 
other component of 2C 2 k (e.g. [4, Example 2.3.8]). 

Lemma 6.5 If graphs G and G' have different diameters (including the case that 
G is connected and G' is disconnected), then Di(G,G') < [logrf(G)] + 2. 

Proof. Assume that d(G) < d(G'). Spoiler begins with selecting two vertices at 
distance d(G) + 1 in G", then jumps to G, and uses the strategy of Lemma 6.4. ■ 

Lemma 6.6 If G is a tree, G' is a connected non-tree, and d(G) = d(G'), then 
D (G,G')< [logd(G)l+4. 

Proof. Denote k = d(G) = d(G'). Let C be a shortest cycle in G'. Notice that 
C has length at most 2k + 1. Spoiler begins with selecting in C a vertex z' along 
with its neighbors x' and y' . Let z, x, and y be the corresponding responses of 
Duplicator in G. The vertex z cannot be a leaf of G for else Duplicator has lost. 
From now on Spoiler plays all the time in H' = G' — z' and Duplicator is enforced 
to play in H = G — z. In these graphs d(x', y') <2k — l and d(x, y) = oo. Therefore 
the strategy of Lemma 6.4 applies and Spoiler wins in at most \\og(2k — 1)] extra 
moves. ■ 
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Lemma 6.7 Let T and V he two non-isomorphic diverging trees with d(T) = d(T') 
(and hence r(T) = r(T') ). Then D(T, T) < r(T) + 1. 

Proof. In the first move Spoiler selects x, a central vertex of T. Duplicator's 
response, x', should be a central vertex of T' because otherwise Spoiler selects a 
vertex y' in T' with d(x' ', y') > r(T) and applies the strategy of Lemma 6.4. We will 
denote the vertices selected by the players in T and T' during the i-th round by Xi 
and x\\ In particular, x\ — x and x[ = x' . Spoiler will play so that (x\, . . . , Xi) and 
(x[, ■ ■ ■ jx'j) are always paths. Another condition that will be obeyed by Spoiler is 
that T x (xi) and T' x ,{x'^ are non-isomorphic. 

Assume that the i-th. round has been played. If exactly one of the vertices X; L 
and x\ is a leaf (we will call a such situation terminal), then Spoiler prolongs that 
path for which this is possible and wins. Assume that neither of Xi and x a is a 
leaf and that T x (xi) and T' x ,{x'^ are non-isomorphic (in particular, this is so for 
% — 1). By the definition of a diverging rooted tree, all T x (u) with u a child of Xi 
are pairwise non-isomorphic. The same concerns all T' x ,(u') with u' a child of x\. 
It follows that there is a T x (u) not isomorphic to any of the T^,(-u')'s or there is a 
T' x ,{u') not isomorphic to any of the T x (u)'s. Spoiler selects such u for Xi + \ or u' 
for x' i+1 . Clearly, Spoiler has an appropriate move until a terminal situation occurs. 
The latter occurs in the r(T)-th round at latest. ■ 

Lemma 6.8 Let T and V be two trees with d(T) = d{T') (and hence r(T) = r(T') ). 
Suppose that T is diverging hut V is not. Then D(T, T) < r(T) + 2. 

Proof. In the first move Spoiler selects x', a central vertex of T' . Similarly to the 
preceding proof, we may suppose that Duplicator's response x is a central vertex of 
T. Let y' be a vertex of T' such that T' x ,{y') is not diverging but, for any child z' of 
y', T' x ,(z') is. Note that y' must have two children z[ and z' 2 such that T x ,(z[) and 
T x ,(z' 2 ) are isomorphic. 

In subsequent moves Spoiler selects the path P' = (x' : . . . , y', z[). Let P = 
(x, . . . , y, z) be Duplicator's response in T. If T x (z) and T x t[z'^) have different depths 
d and d', say d > d', then Spoiler prolongs P with d' + 1 new vertices and wins. It 
is clear that the prolonged path has at most r(T) + 1 vertices. 

Suppose now that d = d' . If T x (z) and T x i(z[) are non-isomorphic, then Spoiler 
adopts the strategy of Lemma 6.7 and wins having made totally at most r(T) + 1 
moves. If T x (z) and T x ,(z[) are isomorphic, then Spoiler selects z' 2 . In response 
Duplicator must select a child of y different from z. Denote it by z*. The subtree 
T x (z*) is non-isomorphic to T x (z) and hence to T' x ,(z' 2 ). Now Spoiler is able to 
proceed with T x (z*) and T x ,(z' 2 ) as it was described and wins having made totally 
at most r(T) + 2 moves (one extra move was made to switch from z[ to z' 2 ). ■ 

Lemma 6.9 Let T be a diverging tree of radius at least 6. Then D(T) < r(T) + 2. 

Proof. Let T' be a graph non-isomorphic to T. The pair T, T' satisfies the condition 
of one of Lemmas 6.5-6.8. These lemmas provide us with bound D(T, T') < r(T) +2. 
By Proposition 3.6, we therewith have the bound for D(T). ■ 
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We have shown that diverging trees are definable with quantifier rank no much 
larger than the radius. It remains to show that, given the radius, there are diverging 
trees with large order and, moreover, the orders of these large trees fill long segments 
of integers. 

Lemma 6.10 Given % > 0, let Mi denote the total number of (pairwise non- 
isomorphic) diverging rooted trees of depth at most i. Then Mi = T(i). 

Proof. Let m 8 denote the number of diverging rooted trees of depth precisely i. 
Thus, m Q = 1 and Mj = m + . . . + m^. By Lemma 6.2, a depth- (« + 1) tree T v is 
uniquely determined by the set of its t>-branches, which are diverging rooted trees 
of depth at most i. Vise versa, any set of diverging rooted trees of depth at most 
% with at least one tree of depth precisely i, determines a depth- (i + 1) tree. It 
follows that rrii + i = (2 mi — l)2 Mi ~ 1 , where we put M_i = 0. By induction, we obtain 
mi = T(i) -T(i- I) and Mi = T(i). m 

Note that a diverging rooted tree of depth % can have the minimum possible 
number of vertices i + 1 (a path). 

Lemma 6.11 Let iVj denote the maximum order of a diverging rooted tree of depth 
i. Then N { > T(i - 1). 

Proof. The largest diverging rooted tree T v of depth % has every of Mj_! diverging 
rooted trees of depth at most i — 1 as a f-branch. Thus, iVj > Mj_! = T(i — 1). ■ 

Lemma 6.12 For every n such that % + 1 < n < iVj there is a diverging rooted tree 
of depth % and order n. 

Proof. We proceed by induction on i. The base case of % = is trivial. Let % > 1. 
For n = i + 1 we are done with a path. We will prove that any diverging rooted tree 
T v of depth % except the path can be modified so that it remains a diverging rooted 
tree of the same depth but the order becomes 1 smaller. 

Let / be the smallest depth of a w-branch of T v and fix a branch T v (w) of this 
depth with minimal order. If T v (w) is a path, we delete its leaf. If not, we reduce it 
by the induction assumption. ■ 

Lemma 6.13 Let i > 2. For every n such that 2i + 2 < n < 2N i} there is a 
diverging tree of order n and radius i + 1. 

Proof. If n = 2m is even, consider the diverging rooted tree T c with two c-branches, 
one of order m, the other of order m — 1, and both of depth % (excepting the case 
that n = 2% + 2 when the smaller branch has depth % — 1). Such branches do exists 
by Lemma 6.12. If n = 2m+ 1 is odd, we add the third single-vertex c-branch. Since 
the root c is a central vertex of the underlying tree, the latter is diverging. ■ 
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Proof of Theorem 6.1. Let n > 32 = 2T(3) and let i > 3 be such that 2T(i) < 
ra < 2T(i + 1). By Lemma 6.11, we have 2% + 6 < n < 2N i+2 . Owing to Lemma 
6.13, there exists a diverging tree T of order n and radius i + 3. Lemma 6.9 gives 
-D(T) < i + 5 < log*n + 5. 

For every n < 32 the required bound is provided by P n , the path on n vertices. 
It is not hard to derive from Lemma 6.5 that Di(P n ) < logn + 3 for all n, which 
satisfies our needs for n in the range. ■ 



7 The succinctness function over trees: Zero al- 
ternations 

Theorem 6.1 assumes no restriction on the alternation number. We now prove a 
weaker analog of this theorem for qo(n; trees) = mm\T\= n D (T) , the succinctness 
function over trees with the strongest restriction on the alternation number. This 
is somewhat surprising in view of Corollary 5.7 (1) asserting that Dq(G) and D(G) 
may be very far apart from one another. 

Theorem 7.1 For infinitely many n we have qo(n; trees) < 2 log* n + 0(1). 
The proof takes the rest of the section. 

7.1 Ranked trees 

We will modify the approach worked out in the preceding section. The proof of 
Theorem 6.1 was based on Lemmas 6.5-6.8. Note that the alternation number 
in Lemma 6.6 is 0. In Lemma 6.5 it is 1, but the bound of this lemma is actually 
stronger than we need and, at the cost of some relaxation, we will be able to improve 
the alternation number to (see Lemmas 7.6 and 7.8 below). The real source of 
non-constant alternation number is Lemma 6.7 (Lemma 6.8 reduces to Lemma 6.7 
and itself makes no new complication). To tackle the problem, we restrict the class 
of diverging trees so that we will still have relation D (T) = 0(r(T)) and there will 
still exist trees with Tower (r(T) — 0(1)) vertices. 

We begin with introducing some notions and notation concerning rooted trees. 
Given a rooted tree T v , let B(T V ) denote the set of all f-branches of T v . Given 
rooted trees T 1; . . . , T m , we define T — T\ • • • T rn to be the rooted tree with 
B(T) = {Ti, . . . ,T m }. By Lemma 6.2, if all T« are pairwise non-isomorphic and 
diverging, then T is diverging as well. Obviously, depthT = 1 + maxj depthTi. 

Let T' v , and T v be rooted trees. We call T' v , a rooted subtree of T v if v' — v and 
V(T) C V(T). 

For each % > 0, we now define the class of rooted trees R* as follows. Let 
Rq = {T*, T 2 * , T 3 *, T4 }, the set of four rooted trees depicted in Figure 1. Observe the 
following properties of this set. 

(Zl) \T*\ < 8 for all i. 
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Figure 1: Rq. 



(Z2) depthT* = 4 



for all %. 



(Z3) All T* are diverging. 



(Z4) No T* is isomorphic to a rooted subtree of any other T*. 



Assume that R*_i is already specified. We will need a large enough Fj C 2 a 
family of subsets of R*_i which is an antichain with respect to the inclusion (i.e. no 
member of Fi is included in any other member of Fj). As one of suitable possibilities 
(which actually maximizes \Fi\ by Sperner's theorem), we fix 



Note that \R*\ = |F;|. 

It is clear that, if T G R* : then B(T) consists of pairwise non- isomorphic rooted 
trees in R*_ v By easy induction, we have the following properties of the class R* 
for i > 1. 

(Rl) If T G then r(T) = deptiiT = i + 4. 
(R2) If T G then d(T) = 2i + 8. 

(R3) If T G i?*, then the central vertex of T is equal to the root. 
(R4) All T G i?* are diverging. 

(R5) If T and T' are different members of R*, then neither -B(T) C B(T') nor 
S(T') C B{T). 




the family of all |_j J R*_ 1 |/2 J -element subsets of R*_ ± . Now 
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We define Ri to be the set of underlying trees of rooted trees in R*. Note that 
for different T, T' G R* their underlying trees are non-isomorphic. If % — 0, this is 
evident. If % > 1, we use the fact that, as any isomorphism between the unrooted 
trees takes one central vertex to the other, it is also an isomorphism between the 
rooted trees. Note also that trees in Ri are diverging. 

We will call trees in R = U~i Ri ranked. If T e Ri, we will say that T has rank 
i and write rkT = i. 

Lemma 7.2 Let iVj denote the minimum order of a tree of rank i. Then iVj > 



7.2 Spoiler's strategy 

Consider the Ehrenfeucht game on rooted trees (T„, 7^/). Let Xj denote the vertex of 
T v selected in the i-th round. We call a strategy for Spoiler continuous if he plays all 
the time in T v and, for each i, the induced subgraph T[{v , x\, . . . , Xi}} is connected. 

Lemma 7.3 Let T v and T' v , be non-isomorphic rooted trees in R*. Then Spoiler 
has a continuous winning strategy in EHR i+7 (T„, 7^,) and hence D (T V , T^,) < i + 7. 

Proof. We proceed by induction on i. In the base case of % — 0, Spoiler selects all 
non-root vertices of T v in a continuous manner and wins by Property (Z4). Let % > 1. 
In the first move Spoiler selects w, a child of v such that, for any w', child of v', 
branches T v (w) and T^iw') are not isomorphic. This is possible owing to Property 
(R5). Let w' denote Duplicator's response. Both T v (w) and T^(w') have rank i — 
Spoiler now invokes a continuous strategy winning EURi +e (T v (w) , T^(w')), which 
exists by the induction assumption. ■ 

Lemma 7.4 Let T, T' be trees of the same even diameter and v, v' be their central 
vertices. Assume that Spoiler selects v but Duplicator responds with a vertex dif- 
ferent from v'. Then Spoiler is able to win in the next d(T) moves, playing all the 
time in T. 

Proof. In a continuous manner, Spoiler selects the vertices of a diametral path in 
T. Let u 7^ v ' be the vertex selected by Duplicator in response to v. Duplicator 
should now to exhibit a path of length d(T') = d(T) with u at the middle, which is 
impossible by Proposition 2.1. ■ 



T(i-0(1)). 



Proof. Denote = \Ri\. By the construction, we have 




and 




The lemma follows by simple estimation. 
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Lemma 7.5 Let T and T' be non-isomorphic ranked trees of the same rank. Then 
D (T,T') < 2 rkT + 9. 

Proof. Let v and v' be central vertices of T and T 1 respectively. Spoiler starts 
by selecting v. If Duplicator responds not with v', Spoiler applies the strategy of 
Lemma 7.4 and wins in the next d(T) moves. If Duplicator responds with v', Spoiler 
applies the strategy of Lemma 7.3 and wins in the next rkT + 7 moves. In any case 
Spoiler wins in 1 + max{d(T) , rk T + 7} = 2 rk T + 9 moves. ■ 

Lemma 7.6 Let T be a ranked tree and G be either a tree of different diameter or 
a connected non-tree. Then D (T, G) <2rkT + 10. 

Proof. If G is a tree, then d(T) + 2 moves are enough for Spoiler to win. In this 
case, he selects a path of length min{<i(T), d(G)} + 1 in the graph of larger diameter. 

Suppose that G is not a connected non-tree. If G has a cycle on at most d(T) + 2 
vertices, Spoiler selects it and wins. Otherwise G must have a cycle on at least 
d(T) + 3 vertices. Spoiler wins by selecting a path on d(T) + 2 vertices of this 
cycle. ■ 

Lemma 7.7 Let T be a ranked tree and G be a non-ranked tree. If d(T) = d(G), 
then D Q (T,G) < 2 rkT + 9. 

Proof. Let v and c denote the central vertices of T and G respectively. The tree 
in which Spoiler plays will be specified below. In the first move Spoiler selects the 
central vertex of this tree. If Duplicator responds not with the central vertex of 
the other tree, he loses in the next d(T) moves by Lemma 7.4. Assume that she 
responds with the central vertex. Further play depends on which of three categories 
G belongs to. Let k = rkT. For any w G V(G) at distance k from c, we will call 
G c (w) an apex of G c . 

Case 1: G c has an apex G c (w) which is not a rooted subtree of any of the four 
rooted trees in R$. Spoiler plays in G. In the next k moves he selects the path from 
c to w. Duplicator is enforced to select the path from v to a vertex u such that 
T v (u) G Rq. Spoiler is now able to win by selecting at most 8 vertices of G c (w). The 
total number of moves does not exceed l + /c + 8 = /c + 9. 

Case 2: G has a vertex w such that B(G c (w)) properly contains B(H W ) for some 
H w G R* , where i = k — d(c,w). Spoiler plays in G. In the next d(c,w) moves 
he selects the path from c to w. Let u denote the vertex selected by Duplicator 
in response to w and F u = T v (u). Clearly, Duplicator must ensure the equality 
d(v,u) = d(c,w) and hence F u G R*. 

If F u and H w are not isomorphic, then Spoiler restricts further play to H w follow- 
ing a continuous strategy. Of course, Duplicator is enforced to play in F u . Spoiler 
is able to win in the next % + 7 moves according to Lemma 7.3. 

Suppose now that F u and H w are isomorphic. In the next move Spoiler selects 
a child of w which is not in H w . Duplicator must respond with a child of u in 
F u . Denote it by x and let y be the vertex of H w corresponding to x under the 
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isomorphism from F u to H w . Recall that, by Lemma 6.3, diverging trees are asym- 
metric and therefore such isomorphism is unique. In the next move Spoiler selects 
y. Duplicator must respond with z, another child of u in F u . Note that F u (z) and 
H w (y) are not isomorphic since the latter is isomorphic to F u (x) but the former is 
not. From now on Spoiler restricts play to F u (z) and H w {y) using the strategy of 
Lemma 7.3, and wins in the next i + 6 moves. The total number of moves is at most 
1 + d(c,w) +i + 8 = k + 9. 

Case 3: Neither 1 nor 2. Spoiler plays all the time in T. We will denote 
the vertices selected by him in the next k moves by xi, . . . ,Xk subsequently. Let 
yi, . . . ,yk denote the corresponding vertices selected in G by Duplicator. Put also 
xo — v and yo = c. Spoiler will play so that x ,xi, . . . ,Xk will be a path. Let 
1 < i < k. Suppose that the preceding x , . . . , are already selected. Assume 
that T v (xi-i) and G c {yi-\) are non-isomorphic (note that this is so for % = 1). As we 
are not in Case 2, has a child x such that T v (x) B(G c (yi-i)). Spoiler takes this 
x for Xi thereby ensuring that T v (xi) and G c (yi) are non-isomorphic again, whatever 
yi is selected by Duplicator. The final stage of the game goes on non-isomorphic 
T v (xk) and G c (y k ). Spoiler selects all vertices of T v (x k ). 

Note that T v (xk) G Rq and G c {y k ) is an apex of G. As we are not in Case 1, 
G c (yk) is a rooted subtree of some T* G Rq. If T* = T v (xk), G c (yk) must be a proper 
subtree of T v (xk) and hence Spoiler has won. Otherwise, note that T v (xk) cannot 
be a rooted subtree of G c (yt) by Property (Z4). Again, this is Spoiler's win. The 
total number of moves equals l + k + 7 — k + 8. 

In any of the three cases Spoiler wins in max{l + d(T), k + 9} = 2k + 9 moves. ■ 

Note that, if T is a ranked tree of rank k, then Lemmas 7.5-7.7 provide Spoiler 
with a winning strategy in the O-alternation EHR 2 fc+io(^, G) whenever G is a con- 
nected graph non-isomorphic to T. 

Lemma 7.8 Let T he a ranked tree and H he a disconnected graph. Then D (T, H) 
< 2rkT + 10. 

Proof. We distinguish two cases. 

Case 1: No component of H is isomorphic to T. 

Subcase 1.1: H has a component G such that Spoiler is able to win EHR 2 fc+io(^, 
G) playing all the time in G. Spoiler plays exactly this game. 

Subcase 1.2: H has no such component. In the first move Spoiler selects the 
central vertex of T. Suppose that Duplicator's response is in a component G of 
H . By Lemmas 7.5-7.7, we are either in the situation of Lemma 7.6 (with G a 
tree of diameter d(G) < d(T)) or in the situation of Lemma 7.7 (namely, in Case 
3). In both situations Spoiler has a continuous winning strategy for Ehr 2 £. +10 (T, G) 
allowing him to play all the time in T starting from the central vertex. Spoiler 
applies it and wins as Duplicator is enforced to stay in G. 

Case 2: H has a component T' isomorphic to T . Spoiler plays in H. His first 
move is outside T' . Let x G V(T) be Duplicator's response. Let x' be the counterpart 
of x in T' (recall that ranked trees are asymmetric and hence x' is determined 
uniquely). Denote the central vertices of T and T' by v and v' respectively. In the 
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second move Spoiler selects v'. If Duplicator responds not with v, Spoiler applies the 
strategy of Lemma 7.4 and wins in the next d(T) moves. Assume that Duplicator 
responds with v. Starting from the third move, Spoiler selects the vertices on the 
path between v' and x', one by one, starting from a child of v ' . If Duplicator follows 
the path from v to x, she loses as x is already selected. Assume that Duplicator 
deviates at some point, selecting a vertex y not on the path, and let y' be the vertex 
on the path between v' and x' selected in this round by Spoiler. Note that the 
rooted subtrees T v (y) and T^(y') are non-isomorphic. Spoiler therefore can apply 
the continuous strategy of Lemma 7.3 and win in the next % + 7 moves, where 
i = k — d(v, y). The total number of moves is at most l + max{l + d(T), l + d(x, y) + 
(i + 7)} = 2k + 10. - 

Lemma 7.8 completes our analysis: If T is a ranked tree of rank k and G is an 
arbitrary graph non-isomorphic to T, then we have a winning strategy for Spoiler 
in the 0-alternation EHR 2 fc + io(7\ G). By Proposition 3.6, we conclude that D (T) < 
2rkT+ 10. 

To complete the proof of Theorem 7.1, let Tj be a tree of rank % and order Ni 
as in Lemma 7.2. We have q (Ni, trees) < D (TJ < 2% + 10 < 2 log* Ni + 0(1), the 
latter inequality due to Lemma 7.2. 

8 The succinctness function over trees: Lower 
bound 

Complementing the upper bound given by Theorem 6.1 we now prove a nearly tight 
lower bound on q(n; trees). 

Theorem 8.1 q(n; trees) > log* n — log* log* n — 0(1). 

It will be helpful to work with rooted trees. The first order language for this class 
of structures has a constant R for the root and the parent-child relation P(x, y). Let 
T v and T' u be rooted trees and suppose that T v T' u . By Proposition 3.5, T v and 
T ! u satisfy the same sentences of quantifier rank k. Then T T' for the underlying 
trees. Indeed, take any sentence in the language for trees and replace the adjacency 
x ~ y with P(x,y) V P(y,x). We get a sentence with the same truth value in the 
language of rooted trees. 

Let g(k) be the number of ^-equivalence classes of rooted trees. Similarly to 
Lemma 3.3, we have g(k) <T(k + 2 + log* k) + 0(1). Set 

g(k)-i 

U(k) = £ (kg(k))\ 

i=0 

Lemma 8.2 Let T v be a finite rooted tree. Then, for any k > 1, there exists a 
finite rooted tree T' u with at most U(k) vertices such that T v = k T' u . 
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Proof of Theorem 8.1. Consider an arbitrary tree T of order n and let k = D(T). 
Rooting it at an arbitrary vertex v, consider a rooted tree T v . Let T' u be as in Lemma 
8.2. Thus, we have T = k V and \T'\ < U(k). By the choice of k, T and V must be 
isomorphic. We therefore have 

n < U{k) < {kg{k)) 9(k) < T{k + log* k + 4) + 0(1), 

which implies k > log* n — log* log* n — 0(1). ■ 
Lemma 8.2 follows from a series of lemmas. 

Lemma 8.3 Let T v he a rooted tree and w a non-root vertex ofT v . Suppose that 
T' w = k T v (w). Let T' v he the result of replacing T v (w) by T' w . Then T v = k T' v . 

Proof. Duplicator wins the Ehrenfeucht game on T v , T^ by playing it on T v (w), T' w 
(since the root is a constant symbol she automatically plays root for root) and the 
identical vertices elsewhere. ■ 



Lemma 8.4 Let T v be a rooted tree with wi, . . . , w s the children of the root v, and 
ai,...,a s the k-Ehrenfeucht values of the trees T v (wi). Then the k-Ehrenfeucht 
value of T is determined by the oii 's. 

Proof. If T v and T' u have the same ai, . . . , a s we reach T' u from T v in s applications 
of Lemma 8.3. ■ 



Lemma 8.5 Suppose, in the notation of Lemma 8.4, that some value a appears 
as oii more than k times. Let T~ be T v but with only k of those subtrees. Then 
T v =h T~ . 

Proof. The game has only k moves so Spoiler cannot go in more than k of these 
subtrees. ■ 



Lemma 8.6 IfT v is a representative of a given = k -equivalence class with minimum 
possible order, then each vertex ofT v has at most kg{k) children. 

Proof. This easily follows from Lemmas 8.5 and 8.4 by induction on the depth. ■ 



Lemma 8.7 If T v is a representative of a given = k -equivalence class with minimum 
possible order, then it has depth at most g{k) — 1. 

Proof. Take a longest path from the root to a leaf. If it has more than g (k) vertices, 
it contains two vertices w and u such that u is a descendant of w and T v (u) = k Tv{w). 
Replacing T v {w) by T v {u), we obtain a smaller tree in the same =fc-class. ■ 

Lemma 8.2 immediately follows from Lemmas 8.6 and 8.7. 
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9 The smoothed succinctness function 



Let q(n) = q(n; all) denote the succinctness function for the class of all graphs. Since 
there are only finitely many pairwise inequivalent sentences of bounded quantifier 
rank, q(n) — > oo as n — > oo. We will show that q(n) grows very slowly and, in a 
sense, irregularly. We first summarize an information given by Theorems 4.2 and 6.1. 

Corollary 9.1 

1) There is no general recursive function f such that f(q(n)) > n for all n. 

2) There is no general recursive function l{n) such that l(n) is monotone nonde- 
creasing, l{n) — > oo as n — > oo, and l(n) < q(n) for all n. 

3) q(n) < log* n + 5. 

Proof. 1) Note q(n) < s(n) < s^(n). Now, if there were a general recursive function 
/ such that f(q(n)) > n, then we would have maXj< S3 („) f(i) > n contradictory to 
Theorem 4.2. 

2) Assume that a such l(n) exists. Let f(m) be the first value of % such that 
> m. Then f(q(n)) > n contradictory to Item 1. 

3) As any upper bound on q(n; C) is stronger if it is proved for a smaller class of 
graphs, this item is an immediate consequence of Theorem 6.1. ■ 

Definition 9.2 We define the smoothed succinctness function q*(n) (for quantifier 
rank) to be the least monotone nondecreasing integer function bounding q(n) from 
above, that is, q*{n) = max m <„g(m). 

Theorem 9.3 log* n - log* log* n - 0(1) < q*(n) < log* n + 5. 

Proof. Since the upper bound on q(n) given by Corollary 9.1 (3) is monotone, 
this is a bound on q*(n) as well. The lower bound is derivable from Lemma 3.3. 
This lemma states that \Ehrv(k)\ < T(k + 2 + log* k) + c for a constant c. Given 
n > c + T(3), let k be such that T(A; + 2 + log* k) + c< n < T(k + 3 + \og*(k + l)) + c. 
Assuming that n is sufficiently large, we have k > log* n — log* log* n — 4. According 
to Proposition 3.6, at most \Ehrv{k)\ graphs are definable with quantifier rank at 
most k. By the pigeonhole principle, there will be some m < \Ehrv{k) \ + 1 < n for 
which no graph of order precisely m is defined with quantifier rank at most k. We 
conclude that q*(n) > q(m) > k and hence q*(n) > log* n — log* log* n — 2. ■ 

We defined q*(n) to be the "closest" to q(n) monotone function. Notice that 
q{n) itself lacks the monotonicity. 

Corollary 9.4 q(i + 1) < q(i) for inhnitely many i. 

Proof. Set l(n) = log* n — log* log* n — 2. We have just shown that q*(n) > l(n) for 
all n large enough. By Corollary 9.1 (2), we have q{n) < l{n) for infinitely many n. 
For each such n, let m n < n be such that q{m n ) > l(n). Thus, q{m n ) > q(n) and a 
desired i must exist between m„ and n. ■ 
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For each non-negative integer a and for a = 1/2, define q a {n) = min| G | =n D a (G) 
and q* a (n) = max m < n g a (m). As easily seen, Corollary 9.1 (1) holds true for g 3 (n) as 
well. Note a strengthening of Corollary 9.1 (3) that follows from a result in another 
our paper. Let G(n,p) denote a random graph on n vertices distributed so that 
each edge appears with probability p and all edges appear independently from each 
other. 

Theorem 9.5 [12] With probability approaching 1 as n goes to the infinity, 

D 3 (G(n,n-^ 4 )) = log* n + O(l). 

Corollary 9.6 q^(n) < log*n + 0(l) and hence log* n — log* log* n — O ( 1 ) < q^(n) < 
log*n + 0(l). 

10 Depth vs. length 

Theorem 10.1 L(G) < T(D(G) + log* D(G) + 0(1)). 

Proof. Given an Ehrenfeucht value a, let 1(a) denote the shortest length of a 
formula defining a in the sense of Section 3. Define l(k) to be the maximum 1(a) 
over a G Ehrv(k) and l(k,s) the maximum 1(a) over a G Ehrv(k,s). Of course, 
l(k) = l(k,0). As in Section 3, f(k, s) = \Ehrv(k,s)\. 

It is not hard to see that L(G) < l(D(G)) and therefore it suffices to prove the 
bound l(k) < T(k + log* k + 0(1)) for all k > 2. 

On the account of Lemma 3.4, we have 

l(k, k) < 18 

and 

l(k, s) < f(k, s + l)(l(k, s + 1) + 10) 

if s < k. We will use these relations along with the bounds of Lemma 3.2 for f(k, s). 
Set g(x) = x2 x+1 . A simple inductive argument shows that 

f(k,s) <2^ s) ( 9fe2 ) and l(k,s) <g( k ~ s \9k 2 ). 

Since g(x) < A x , we have l(k, 0) < T A (k + 2 + log* k) < T(k + log* k + 0(1)), where 
T 4 stands for the variant of the tower function built from 4's instead of 2's. ■ 

Remark 10.2 Theorem 10.1 generalizes to structures over an arbitrary vocabulary. 
The proof requires only slight modifications. 

We now observe that the relationship between the optimum quantifier rank and 
length of defining formulas is nearly tight. 
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Theorem 10.3 There are infinitely many pairwise non- isomorphic graphs G with 
L{G) > T(D(G) - 6) - 0(1). 



Proof. The proof is given by a simple counting argument which can be naturally 
presented in the framework of Kolmogorov complexity (applications of Kolmogorov 
complexity for proving complexity-theoretic lower bounds can be found in [15]). 

Denote the Kolmogorov complexity of a binary word w by K{w). Let (G) denote 
the lexicographically first adjacency matrix of a graph G. Define the Kolmogorov 
complexity of G by K(G) = K((G}). Notice that 



The bound K(w) < k can hold for less than 2 k words. It follows that for some 
n < 2 k we have K(G) > k for all graphs G on n vertices. For this particular n we 
have 



Of course, we could run the same argument directly with L(G) in place of K(G). 
An advantage of using the Kolmogorov complexity is in avoiding estimation of the 
number of formulas of length at most k. 

In Section 5.1 we showed that prenex formulas are sometimes unexpectedly effi- 
cient in defining a graph. We are now able to show that, nevertheless, they generally 
cannot be competetive against defining formulas with no restriction on structure. 
Let ]JP renex [Q^ (resp. jj> renex (jG)~) denote the minimum quantifier rank (resp. length) 
of a closed prenex formula defining a graph G. 

Theorem 10.4 There are infinitely many pairwise non- isomorphic graphs G with 



K(G) < L(G) + 0(1). 



By Theorem 6.1, there is a graph G n on n vertices with 



D(G n ) < log*n + 5. 



(11) 



L(G n ) > logn - 0(1). 
Combining (11) and (12), we see that G n is as required. 



(12) 



DP™ nex (G) > T(D(G) - 8). 



Proof. Let G be as in Theorem 10.3. We have 



V renex {G) > L(G) > T(D(G) - 6) 



0(1). 



On the other hand, by Lemma 2.4 we have 



prenex 



(O) < f(D prenex (G)), where f(x) 



0(xH x2 ). 



It follows that 




provided D(G) (or the order of G) is sufficiently large. 
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11 Open questions 



1. Let D'{G) be the minimum quantifier rank of a first order sentence distinguish- 
ing a graph G from any non- isomorphic finite graph G' . Clearly, D'(G) < D(G). 
Can the inequality be sometimes strict? 

2. Improve on the alternation number in Theorem 4.2. The most interesting case 
is that of alternation number 0. By the Ramsey theorem, Turing machines cannot 
be simulated by 0-alternation formulas as this would contradict the unsolvability of 
the halting problem. Thus, an intriguing question is how small so(n) and qo(n) can 
be. 

3. Classify the prefix classes with respect to solvability of the finite satisfiability 
problem over graphs. Such a classification does exist by the Gurevich classifiability 
theorem [1, section 2.3]. In particular, can the prefix 3*yO(i) 3 o(i) y o(i) in Theorem 
5.10 be shortened to 3*V 0(1 * ) 3 ( 1 )? Shortening to 3*V* is impossible due to the 
Ramsey theorem. 

Note that for digraphs the complete classification is known (see [1] and references 
there). In notation of Section 5.4, the minimal undecidable classes for Sat^ n (V) 
are V*3, V3V*, V3V3*, V3*V, 3*V3V, 3*V C+1 3, V c+1 3*, while the maximal decidable 
classes are 3*V* and 3*V C 3*, where c = 1. For Satfi n (V) the classification is the same 
but with c = 2. If we consider Sat = (V) instead of Sat^jV) and Sat(V) instead 
of Satfi n (T>), nothing in classification changes. The reasons are that the maximal 
decidable classes have the finite model property and that the undecidability of the 
minimal undecidable classes is proved by reductions which preserve the finiteness of 
models. 

4. How close to one another are Di(G) and D (G)7 At least, are they recursively 
linked? The same question for D(G) and D a (G) (any a = o(n) is of interest). How 
far apart from one another can be D(G) and D 1 (G)? 

5. Estimate the succinctness function q(n; C) for other classes of graphs (in par- 
ticular, graphs of bounded degree, planar graphs). 

6. Is q{n) a non-recursive function? Is D{G) an incomputable function of graphs 
(T. Luczak)? Of course, the former implies the latter. The same can be asked for 
q a (n) and D a (G) excepting a e {0, 1/2} (see Theorem 5.8). 

7. We know that qlin) = (1 + o(l))log*n. The cases of alternation number 0, 
1, and 2 are open. 

8. \q(n + 1) — q(n)\ = 0(1)? Note g(n + 1) — g(n) < 1 but this difference is 
negative infinitely often by Corollary 9.4. 

9. Can one construct a family of graphs G as in Theorem 10.3 explicitly? 
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